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AB Genomic clones for the largest human neurofilament protein (NF-H) 
were isolated, the intron/exon boundaries mapped, and the entire 
protein-coding regions (exons) sequenced. The predicted protein 
contains a central region that obeys the structural criteria 
identified for a-helical •rod 1 domains typically present in 
all intermediate filament (IF) protein components; it is .apprx.310 
amino acids long, shares amino acid sequence homol . with other IF 
protein rod domains, and displays the characteristic heptad repeats 
of apolar amino acids which facilitate coiled-coil interaction. 
Nevertheless, anomalies are noted in the structure of the F-H rod 
which could explain observations of its poor homopolymeric assembly 
in vitro. The protein segment on the carboxy-terminal side of the 
human NF-H rod is uniquely long (>600 amino acids) compared to other 
IF proteins and is highly charged (>24% Glu, >25% Lys), rich in 
proline (>12%), and impoverished in cysteine, methionine, and aromatic 
amino acids. Its most remarkable feature is repetitive sequence 
that covers more than half its length and includes the sequence 
motif, Lys-Ser-Pro (KSP) , >40 times. Together with the 
identification of the serine in KSP as the main target for 
NF-directed protein kinases in vivo, this repetitive character 
explains the massive phosphorylation of the NF-H subunit that can 
occur in axons. The human NF-H gene has three introns, two of which 
interrupt the protein-coding sequence at identical points to introns 
in the genes for the two smaller NF proteins, NF-M and NF-L. Both 
these introns differ from any of the several introns that have a 
common organization in all other (non-neural) IF genes. However, a 
clear evolutionary relationship between neural and non-neural IFs is 
now revealed by the observation that the third intron in the NF-H 
gene matches the position of one of the conserved introns in the 
non-neural IF gene pattern. Hence, divergence of the two IF 
lineages (neural from non-neural) is more likely to have involved 
ancestral IF gene duplication rather than RNA-mediated 
transposition . 

IT 119213-37-5, Phosphoprotein NF-H (human clone HW10/HW12 
subunit protein moiety reduced) 
RL: PRP (Properties) 

(amino acid sequence of) 
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611252-43-8/BI OR 612112-12-6/BI) 
L4 27 LI AND L3 

L4 ANSWER 1 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 612112-12-6 REGISTRY 

CN Protein (human heart clone GenBank gi: 1083508 9 mitochondria- 
associated) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1115: PN: WO03087768 SEQID: 1115 claimed protein 
CI MAN 
SQL 1026 



SEQ 1 MMS FGGADAL LGAPFAPLHG GGSLHYALAR KGGAGGTRSA AGSSSGFHSW 

51 TRTSVSSVSA SPSRFRGAGA ASSTDSLDTL SNGPEGCMVA VATSRSEKEQ 

101 LQALNDRFAG YIDKVRQLEA HNRSLEGEAA ALRQQQAGRS AMGELYEREV 

151 REMRGAVLRL GAARGQLRLE QEHLLEDIAH VRQRLDDEAR QREEAEAAAR 

201 ALARFAQEAE AARVDLQKKA QALQEECGYL RRHHQEEVGE LLGQIQGSGA 

251 AQAQMQAETR DALKCDVTSA L RE IRAQ LEG HAVQSTLQSE EWFRVRLDRL 

301 SEAAKVNTDA MRSAQEEITE YRRQLQARTT ELEALKSTKD SLERQRSELE 

351 DRHQADIASY QEAIQQLDAE LRNTKWEMAA QLREYQDLLN VKMALDIEIA 

401 AYRKLLEGEE CRIGFGPIPF SLPEGLPKIP SVSTHIKVKS EEKIKWEKS 

451 EKETVIVEEQ TEETQVTEEV TEEEEKEAKE EEGKEEEGGE EEEAEGGEEE 

501 TKSPPAEEAA SPEKEAKSPV KEEAKSPAEA KSPEKEEAKS PAEVKSPEKA 

551 KSPAKEEAKS PPEAKSPEKE EAKSPAEVKS PEKAKSPAKE EAKSPAEAKS 

601 PEKAKSPVKE EAKSPAEAKS PVKEEAKSPA EVKSPEKAKS PTKEEAKSPE 

651 KAKSPEKAKS PEKEEAKSPE KAKSPVKAEA KSPEKAKSPV KAEAKSPEKA 

701 KSPVKEEAKS PEKAKSPVKE EAKSPEKAKS PVKEEAKTPE KAKSPVKEEA 

751 KSPEKAKSPE KAKTLDVKSP EAKTPAKEEA RSPADKFPEK AKSPVKEEVK 



8 01 SPEKAKSPLK EDAKAPEKEI PKKEEVKSPV KEEEKPQEVK VKEPPKKAEE 
851 EKAPATPKTE EKKDSKKEEA PKKEAPKPKV EEKKEPAVEK PKESKVEAKK 
901 EEAEDKKKVP TPEKEAPAKV EVKEDAKPKE KTEVAKKEPD DAKAKEPSKP 
951 AEKKEAAPEK KDTKEEKAKK PEEKPKTEAK AKEDDKTLSK EPSKPKAEKA 
1001 EKSSSTDQKD SKPPEKATED KAAKGK 
HITS AT: 700-731, 742-759 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 139:333132 



L4 ANSWER 2 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 611252-43-8 REGISTRY 

CN Protein (human clone US20030194704-SEQID-32114 exon-derived 

fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 114: PN: US20030194704 SEQID: 32114 claimed protein 
CI MAN 
SQL 617 

SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 
101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 
151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 
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201 AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK EEAKSPEKAK 
251 SPEKEEAKSP EKAKSPVKAE AKSPEKAKSP VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKSPVKEE AKSPEKAKSP 



351 EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKSPVKEEV KSPEKAKSPL 
401 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 139:303033 



L4 ANSWER 3 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 538458-06-9 REGISTRY 

CN Pain-regulated protein (human clone WO03016475-SEQID-9347 ) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN 2255: PN: WO03016475 SEQID : 9347 claimed protein 
CI MAN 
SQL 1020 

SEQ 1 MMSFGGADAL LGAPFAPLHG GGSLHYALAR KGGAGGTRSA AGSSSGFHSW 

51 TRTSVSSVSA SPSRFRGAGA ASSTDSLDTL SNGPEGCMVA VATSRSEKEQ 
101 LQALNDRFAG YIDKVRQLEA HNRSLEGEAA ALRQQQAGRS AMGELYEREV 
151 REMRGAVLRL GAARGQLRLE QEHLLEDIAH VRQRLDDEAR QREEAEAAAR 
201 ALARFAQEAE AARVDLQKKA QALQEECGYL RRHHQEEVGE LLGQIQGSGA 
251 AQAQMQAETR DALKCDVTSA LREIRAQLEG HAVQSTLQSE EWFRVRLDRL 
301 SEAAKVNTDA MRSAQEEITE YRRQLQARTT ELEALKSTKD SLERQRSELE 
351 DRHQADIASY QEAIQQLDAE LRNTKWEMAA QLREYQDLLN VKMALDIEIA 
401 AYRKLLEGEE CRIGFGPIPF SLPEGLPKIP SVSTHIKVKS EEKIKWEKS 
451 EKETVIVEEQ TEETQVTEEV TEEEEKEAKE EEGKEEEGGE EEEAEGGEEE 
501 TKSPPAEEAA SPEKEAKSPV KEEAKSPAEA KSPEKEEAKS PAEVKSPEKA 
551 KSPAKEEAKS PPEAKSPEKE EAKSPAEVKS PEKAKSPAKE EAKSPAEAKS 
601 PEKAKSPVKE EAKSPAEAKS PVKEEAKSPA EVKSPEKAKS PTKEEAKSPE 
651 KAKSPEKEEA KSPEKAKSPV KAEAKSPEKA KSPVKAEAKS PEKAKSPVKE 



701 EAKSPEKAKS PVKEEAKSPE KAKSPVKEEA KTPEKAKSPV KEEAKSPEKA 



751 KSPEKAKTLD VKSPEAKTPA KEEARSPADK FPEKAKSPVK EEVKSPEKAK 

801 SPLKADAKAP EKEIPKKEEV KSPVKEEEKP QEVKVKEPPK KAEEEKAPAT 
851 PKTEEKKDSK KEEAPKKEAP KPKVEEKKEP AVEKPKESKV EAKKEEAEDK 
901 KKVPTPEKEA PAKVEVKEDA KPKEKTEVAK KEPDDAKAKE PSKPAEKKEA 
951 APEKKDTKEE KAKKPEEKPK TEAKAKEDDK TLSKEPSKPK AEKAEKSSST 
1001 DQKDSKPPEK ATE DKAAKGK 
HITS AT: 694-725, 736-753 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 139:18399 



Searcher 
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L4 ANSWER 4 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 538425-66-0 REGISTRY 

CN Pain-regulated protein (human clone WO03016475-SEQID-4798 ) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN 2214: PN: WO03016475 SEQID: 4798 claimed protein 
CI MAN 
SQL 1020 



SEQ 1 MMSFGGADAL LGAPFAPLHG GGSLHYALAR KGGAGGTRSA AGSSSGFHSW 

51 TRTSVSSVSA SPSRFRGAGA ASSTDSLDTL SNGPEGCMVA VATSRSEKEQ 

101 LQALNDRFAG YIDKVRQLEA HNRSLEGEAA ALRQQQAGRS AMGELYEREV 

151 REMRGAVLRL GAARGQLRLE QEHLLEDIAH VRQRLDDEAR QREEAEAAAR 

201 ALARFAQEAE AARVDLQKKA QALQEECGYL RRHHQEEVGE LLGQIQGSGA 

251 AQAQMQAETR DALKCDVTSA LREIRAQLEG HAVQSTLQSE EWFRVRLDRL 

301 SEAAKVNTDA MRSAQEEITE YRRQLQARTT ELEALKSTKD SLERQRSELE 

351 DRHQADIASY QEAIQQLDAE LRNTKWEMAA QLREYQDLLN VKMALDIEIA 

401 AYRKLLEGEE CRIGFGPIPF SLPEGLPKIP SVSTHIKVKS EEKIKWEKS 

451 EKETVIVEEQ TEETQVTEEV TEEEEKEAKE EEGKEEEGGE EEEAEGGEEE 

501 TKSPPAEEAA SPEKEAKSPV KEEAKSPAEA KSPEKEEAKS PAEVKSPEKA 

551 KSPAKEEAKS P PEAKS P EKE EAKSPAEVKS PEKAKS PAKE EAKSPAEAKS 

601 PEKAKS PVKE EAKSPAEAKS PVKEEAKSPA EVKS PEKAKS PTKEEAKSPE 

651 KAKSPEKEEA KSPEKAKSPV KAEAKSPEKA KSPVKAEAKS PEKAKS PVKE 



7 01 EAKS PEKAKS PVKEEAKSPE KAKSPVKEEA KTPEKAKSPV KEEAKSPEKA 



751 KSPEKAKTLD VKSPEAKTPA KEEARS PADK FPEKAKSPVK EEVKSPEKAK 

SPLKADAKAP EKEIPKKEEV KSPVKEEEKP QEVKVKEPPK KAEEEKAPAT 
PKTEEKKDSK KEEAPKKEAP KPKVEEKKEP AVEKPKESKV EAKKEEAEDK 
KKVPTPEKEA PAKVEVKEDA KPKEKTEVAK KEPDDAKAKE PSKPAEKKEA 
APEKKDTKEE KAKKPEEKPK TEAKAKEDDK TLSKEPSKPK AEKAEKSSST 
DQKDSKPPEK ATEDKAAKGK 
694-725, 736-753 



801 
851 
901 
951 
1001 
HITS AT: 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 139:18398 

L4 ANSWER 5 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 528612-99-9 REGISTRY 

CN Protein (human gene CG4399 sequence homolog) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 130: PN: WO03040301 PAGE: 197-198 claimed protein 
CN Putative neurofilament protein (human) 
CI MAN 
SQL 1026 



SEQ 1 MMSFGGADAL LGAPFAPLHG GGSLHYALAR KGGAGGTRSA AGSSSGFHSW 

51 TRTSVSSVSA SPSRFRGAGA ASSTDSLDTL SNGPEGCMVA VATSRSEKEQ 
101 LQALNDRFAG YIDKVRQLEA HNRSLEGEAA ALRQQQAGRS AMGELYEREV 
151 REMRGAVLRL GAARGQLRLE QEHLLEDIAH VRQRLDDEAR QREEAEAAAR 
201 ALARFAQEAE AARVDLQKKA QALQEECGYL RRHHQEEVGE LLGQIQGSGA 
251 AQAQMQAETR DALKCDVTSA LREIRAQLEG HAVQSTLQSE EWFRVRLDRL 
301 SEAAKVNTDA MRSAQEEITE YRRQLQARTT ELEALKSTKD SLERQRSELE 
351 DRHQADIASY QEAIQQLDAE IRNTKWEMAA QLREYQDLLN VKMALDIEIA 
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401 AYRKLLEGEE CRIGFGPIPF SLPEGLPKIP SVSTHIKVKS EEKIKWEKS 
451 EKETVIVEEQ TEETQVTEEV TEEEEKEAKE EEGKEEEGGE EEEAEGGEEE 
501 TKSPPAEEAA SPEKEAKSPV KEEAKSPAEA KSPEKEEAKS PAEVKSPEKA 
551 KSPAKEEAKS PPEAKSPEKE EAKSPAEVKS PEKAKSPAKE EAKSPAEAKS 
601 PEKAKSPVKE EAKSPAEAKS PVKEEAKSPA EVKSPEKAKS PTKEEAKSPE 
651 KAKSPEKAKS PEKEEAKSPE KAKS PVKAEA KSPEKAKSPV KAEAKSPEKA 

701 KSPVKEEAKS PEKAKSPVKE EAKSPEKAKS PVKEEAKTPE KAKSPVKEEA 



751 KSPEKAKSPE KAKTLDVKSP EAKTPAKEEA RS PADKFPEK AKSPVKEEVK 



801 SPEKAKSPLK EDAKAPEKEI PKKEEVKSPV KEEEKPQEVK VKEPPKKAEE 
851 EKAPATPKTE EKKDSKKEEA PKKEAPKPKV EEKKEPAVEK PKESKVEAKK 
901 EEAEDKKKVP TPEKEAPAKV EVKEDAKPKE KTEVAKKEPD DAKAKEPSKP 
951 AEKKEAAPEK KDTKEEKAKK PEEKPKTEAK AKEDDKTLSK EPSKPKAEKA 
1001 EKSSSTDQKD SKPPEKATED KAAKGK 
HITS AT: 700-731, 742-759 



REFERENCE 1: 138:397302 



L4 ANSWER 6 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 437023-11-5 REGISTRY 

CN Protein (human clone US2002004 8763-SEQID-36182 exon-encoded 

fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1177: PN: US20020048763 SEQID: 36182 claimed protein 
CI MAN 
SQL 617 



SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 

101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 

151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 

201 AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK EEAKSPEKAK 

251 SPEKEEAKSP EKAKSPVKAE AKSPEKAKSP VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKSPVKEE AKSPEKAKSP 



351 EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKSPVKEEV KSPEKAKSPL 
4 01 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE D KAAKGK 
HITS AT: 291-322, 333-350 

* * RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 137:28982 

L4 ANSWER 7 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 420909-41-7 REGISTRY 

CN Protein (human clone WO01057273-SEQID-28953 exon-encoded fragment) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2760: PN: WO0157273 SEQID: 28953 claimed protein 



Searcher 



Shears 
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CI MAN 
SQL 617 

SEQ 1 KLLEGEECRI 

51 TVIVEEQTEE 

101 PPAEEAASPE 

151 AKEEAKSPPE 

201 AKSPVKEEAK 

251 SPEKEEAKSP 



GFGPIPFSLP EGLPKIPSVS 
TQVTEEVTEE EEKEAKEEEG 
KEAKSPVKEE AKSPAEAKSP 
AKSPEKEEAK SPAEVKSPEK 
SPAEAKSPVK EEAKSPAEVK 
EKAKS PVKAE AKSPEKAKSP 



THIKVKSEEK IKWEKSEKE 
KEEEGGEEEE AEGGEEETKS 
EKEEAKSPAE VKSPEKAKSP 
AKSPAKEEAK SPAEAKSPEK 
SPEKAKSPTK EEAKSPEKAK 
VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKSPVKEE AKSPEKAKSP 

351 EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKSPVKEEV KSPEKAKSPL 
401 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 



* * RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 136:351355 



COPYRIGHT 2004 ACS on STN 



L4 ANSWER 8 OF 27 REGISTRY 
RN 419604-01-6 REGISTRY 

CN Protein (human cervix cell clone WO0157278-SEQID-21284 exon-encoded 

fragment) (9CI) ■ (CA INDEX NAME) 
OTHER NAMES: 

CN 1884: PN: WO0157278 SEQID: 21284 claimed protein 
CI MAN 
SQL 617 



SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 

101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 

151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 

201 AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK EEAKSPEKAK 

251 SPEKEEAKSP EKAKS PVKAE AKSPEKAKSP VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKSPVKEE AKSPEKAKSP 

351 EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKSPVKEEV KSPEKAKSPL 
401 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 136:351347 

L4 ANSWER 9 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 409392-10-5 REGISTRY 



Searcher 



Shears 



571-272-2528 



09/847586 



CN Protein (human brain clone WO0157275-SEQID-28374 exon-encoded 

fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2394: PN: WO0157275 SEQID: 28374 claimed protein 
CI MAN 
SQL 617 



SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 
101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 
151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 
201 AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK EEAKSPEKAK 
251 SPEKEEAKSP EKAKSPVKAE AKSPEKAKSP VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKS PVKEE AKSPEKAKSP 



351 EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKS PVKEEV KSPEKAKSPL 
4 01 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 136:305084 

L4 ANSWER 10 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 400154-00-9 REGISTRY 

CN Protein (human fetal liver clone WO0157277-SEQID-28080 exon-encoded 

fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2295: PN: WO0157277 SEQID: 28080 claimed protein 
CI MAN 
SQL 617 



SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 
101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 
151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 
201 AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK EEAKSPEKAK 
251 SPEKEEAKSP EKAKSPVKAE AKSPEKAKSP VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKS PVKEE AKSPEKAKSP 



351 EKAKTLDVKS PEAKTPAKEE 
401 KADAKAPEKE IPKKEEVKSP 
451 EEKKDSKKEE APKKEAPKPK 
501 PTPEKEAPAK VEVKEDAKPK 
551 KKDTKEEKAK KPEEKPKTEA 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 



ARSPADKFPE KAKS PVKEEV KSPEKAKSPL 
VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
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REFERENCE 1: 136:178933 



L4 ANSWER 11 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 400113-81-7 REGISTRY 

CN Protein (human clone WO0157274-SEQID-22654 exon-encoded fragment) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2061: PN: WO0157274 SEQID: 22654 claimed protein 
CI MAN 
SQL 617 

SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 
101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 
151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 
201 AKS PVKEEAK SPAEAKSPVK EEAKS PAEVK SPEKAKSPTK EEAKSPEKAK 
251 SPEKEEAKSP EKAKSPVKAE AKSPEKAKSP VKAEAKSPEK AKS PVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKSPVKEE AKSPEKAKSP 



351 : EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKS PVKEEV KSPEKAKSPL 
401 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 



1: 136:195264 



L4 ANSWER 12 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 391971-89-4 REGISTRY • 

CN Heavy neurofilament subunit (human gene NF-H) (9CI) 
OTHER NAMES: 



CN 4211: PN: WO03038130 

CN GenBank CAA33366 

CN GenBank CAA33366 (Translated from 

CI MAN 

SQL 1020 



FIGURE: 3 claimed protein 

GenBank X15306) 



(CA INDEX NAME) 



SEQ 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



MMS FGGADAL 
TRTSVSSVSA 
LQALNDRFAG 
REMRGAVLRL 
ALARFAQEAE 
AQAQMQAETR 
SEAAKVNTDA 
DRHQADIASY 
AYRKLLEGEE 
EKETVIVEEQ 
TKSPPAEEAA 
KSPAKEEAKS 
PEKAKSPVKE 
KAKSPEKEEA 



LGAPFAPLHG 
SPSRFRGAGA 
YIDKVRQLEA 
GAARGQLRLE 
AARVDLQKKA 
DALKCDVTSA 
MRSAQEEITE 
QEAIQQLDAE 
CRIGFGPIPF 
TEETQVTEEV 
SPEKEAKSPV 
PPEAKSPEKE 
EAKSPAEAKS 
KSPEKAKSPV 



GGSLHYALAR 
ASSTDSLDTL 
HNRSLEGEAA 
QEHLLEDIAH 
QALQEECGYL 
LREIRAQLEG 
YRRQLQARTT 
LRNTKWEMAA 
SLPEGLPKIP 
TEEEEKEAKE 
KEEAKSPAEA 
EAKSPAEVKS 
PVKEEAK SPA 
KAEAKSPEKA 



KGGAGGTRSA 
SNGPEGCMVA 
ALRQQQAGRS 
VRQRLDDEAR 
RRHHQEEVGE 
HAVQSTLQSE 
ELEALKSTKD 
QLREYQDLLN 
SVSTHIKVKS 
EEGKEEEGGE 
KSPEKEEAKS 
PEKAKSPAKE 
EVKSPEKAKS 
KSPVKAEAKS 



AGSSSGFHSW 
VATSRSEKEQ 
AMGELYEREV 
QREEAEAAAR 
LLGQIQGSGA 
EWFRVRLDRL 
SLERQRSELE 
VKMALDIEIA 
EEKIKWEKS 
EEEAEGGEEE 
PAEVKSPEKA 
EAKSPAEAKS 
PTKEEAKSPE 
PEKAKSPVKE 



Searcher 



Shears 



571-272-2528 
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701 EAKSPEKAKS PVKEEAKSPE KAKSPVKEEA KTPEKAKSPV KEEAKSPEKA 



751 KSPEKAKTLD VKSPEAKTPA KE EARS PAD K FPEKAKSPVK EEVKSPEKAK 

801 SPLKADAKAP EKEIPKKEEV KSPVKEEEKP QEVKVKEPPK KAEEEKAPAT 
851 PKTEEKKDSK KEEAPKKEAP KPKVEEKKEP AVEKPKESKV EAKKEEAEDK 
901 KKVPTPEKEA PAKVEVKEDA KPKEKTEVAK KEPDDAKAKE PSKPAEKKEA 
951 APEKKDTKEE KAKKPEEKPK TEAKAKEDDK TLSKEPSKPK AEKAEKSSST 
1001 DQKDSKPPEK ATEDKAAKGK 
HITS AT: 694-725, 736-753 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 138:380506 



REFERENCE 2: 136:146104 

L4 ANSWER 13 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 374653-90-4 REGISTRY 

CN Protein {human HBLl 00 cell clone WO0157270-SEQID-12927 exon-encoded 

fragment) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2869: PN: WO0157270 SEQID: 12927 claimed protein 
CI MAN 
SQL 617 



SEQ 1 KLLEGEECRI GFGPIPFSLP EGLPKIPSVS THIKVKSEEK IKWEKSEKE 

51 TVIVEEQTEE TQVTEEVTEE EEKEAKEEEG KEEEGGEEEE AEGGEEETKS 

101 PPAEEAASPE KEAKSPVKEE AKSPAEAKSP EKEEAKSPAE VKSPEKAKSP 

151 AKEEAKSPPE AKSPEKEEAK SPAEVKSPEK AKSPAKEEAK SPAEAKSPEK 

201 AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK EEAKSPEKAK 

251 SPEKEEAKSP EKAKSPVKAE AKSPEKAKSP VKAEAKSPEK AKSPVKEEAK 



301 SPEKAKSPVK EEAKSPEKAK SPVKEEAKTP EKAKSPVKEE AKSPEKAKSP 



351 EKAKTLDVKS PEAKTPAKEE ARSPADKFPE KAKSPVKEEV KSPEKAKSPL 
401 KADAKAPEKE IPKKEEVKSP VKEEEKPQEV KVKEPPKKAE EEKAPATPKT 
451 EEKKDSKKEE APKKEAPKPK VEEKKEPAVE KPKESKVEAK KEEAEDKKKV 
501 PTPEKEAPAK VEVKEDAKPK EKTEVAKKEP DDAKAKEPSK PAEKKEAAPE 
551 KKDTKEEKAK KPEEKPKTEA KAKEDDKTLS KEPSKPKAEK AEKSSSTDQK 
601 DSKPPEKATE DKAAKGK 
HITS AT: 291-322, 333-350 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 136:1595 



L4 ANSWER 14 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 355043-60-6 REGISTRY 

CN Protein {human clone 787CIP2B_152 contig-encoded fragment) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN 338: PN: WO0157190 SEQID: 3455 claimed sequence 
CI MAN 
SQL 1033 



Searcher 



Shears 



571-272-2528 



09/847586 



SEQ 1 APTAQAMMSF GGADALLGAP FAPLHGGGSL HYALARKGGA GGTRSAAGSS 

51 SGFHSWTRTS VSSVSASPSR FRGAGAASST DSLDTLSNGP EGCMVAVATS 

101 RSEKEQLQAL NDRFAGYIDK VRQLEAHNRS LEGEAAALRQ QQAGRSAMGE 

151 LYEREVREMR GAVLRLGAAR GQLRLEQEHL LEDIAHVRQR LDDEARQREE 

201 AEAAARALAR FAQEAEAARV DLQKKAQALQ EECGYLRRHH QEEVGELLGQ 

251 IQGSGAAQAQ MQAETRDALK C DVT SAL RE I RAQLEGHAVQ STLQSEEWFR 

301 VRLDRLSEAA KVNTDAMRSA QEEITEYRRQ LQARTTELEA LKSTKDSLER 

351 QRSELEDRHQ ADIASYQEAI QQLDAELRNT KWEMAAQLRE YQDLLNVKMA 

401 LDIEIAAYRK LLEGEECRIG FGPIPFSLPE GLPKIPSVST HIKVKSEEKI 

451 KWEKSEKET VIVEEQTEET QVTEEVTEEE DKEAKEEEGK EEEGGEEEEA 

501 EGGEEETKSP PAEEAASPEK EAKSPVKEEA KSPAEAKSPE KEEAKSPAEV 

551 KSPEKAKSPA KEEAKSPPEA KSPEKDGKQN FQAEVKSPEK AKSPAKEEAK 

601 SPAEAKSPEK AKSPVKEEAK SPAEAKSPVK EEAKSPAEVK SPEKAKSPTK 

651 EEAKSPEKAK SPEKAKSPEK EEAKSPEKAK SPVKAEAKSP EKAKSPVKAE 

701 AKSPEKAKSP VKEEAKSPEK AKSPVKEEAK SPEKAKSPVK EEAKTPEKAK 



751 SPVKEEAKSP EKAKSPEKAK TLDVKSPEAK TPAKEEARSP ADKFPEKAKS 



801 PVKEEVKSPE KAKSPLKEDA KAPEKEIPKK EEVKSPVKEE EKPQEVKVKE 
851 PPKKAEEEKA PATPKTEEKK DSKKEEAPKK EAPKPKVEEK KEPAVEKPKE 
901 SKVEAKKEEA EDKKKVPTPE KEAPAKVEVK EDAKPKEKTE VAKKEPDDAK 
951 AKEPSKPAEK KEAAPEKKDT KEEKAKKPEE KPKTEAKAKE DDKTLSKEPS 
1001 KPKAEKAEKS SSTDQKDSKP PEKATEDKAA KGK 
HITS AT: 707-738, 749-766 

REFERENCE 1: 135:176460 

L4 ANSWER 15 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 355029-63-9 REGISTRY 

CN Protein (human clone 787CIP2B_152 ) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 795: PN: WO0157190 SEQID: 1487 claimed protein 
CI MAN 
SQL 1026 

SEQ 1 MMS FGGADAL 

51 TRTSVSSVSA 
101 LQALNDRFAG 
151 REMRGAVLRL 
201 ALARFAQEAE 
251 AQAQMQAETR 
301 SEAAKVNTDA 
351 DRHQADIASY 
401 AYRKLLEGEE 
451 EKETVIVEEQ 
501 TKSPPVEEAA 
551 KSPAKEEAKS 
601 PEKAKSPVKE 
651 KAKSPEKAKS 

701 KSPVKEEAKS 



751 KSPEKAKSPE 



801 SPEKAKSPLK 



LGAPFAPLHG GGSLHYALAR 
SPSRFRGAGA ASSTDSLDTL 
YIDKVRQLEA HNRSLEGEAA 
GAARGQLRLE QEHLLEDIAH 
AARVDLQKKA QALQEECGYL 
DALKCDVTSA LREIRAQLEG 
MRSAQEEITE YRRQLQARTT 
QEAIQQLDAE LRNTKWEMAA 
CRIGFGPIPF SLPEGLPKIP 
TEETQVTEEV TEEEEKEAKE 
SPEKEAKSPV KEEAKSPAEA 
PPEAKSPEKE EAKS PAEVKS 
EAKSPAEAKS PVKEEAKSPA 
PEKEEAKSPE KAKSPVKAEA 

PEKAKSPVKE EAKSPEKAKS 

KAKTLDVKSP EAKTPAKEEA 

EDAKAPEKEI PKKEEVKSPV 



KGGAGGTRSA AGSSSGFHSW 
SNGPEGCMVA VATSRSEKEQ 
ALRQQQAGRS AMGELYEREV 
VRQRLDDEAR QREEAEAAAR 
RRHHQEEVGE LLGQIQGSGA 
HAVQSTLQSE EWFRVRLDRL 
ELEALKSTKD SLERQRSELE 
QLREYQDLLN VKMALDIEIA 
SVSTHIKVKS EEKIKWEKS 
EEGKEEEGGE EEEAEGGEEE 
KSPEKEEAKS PAEVKS PEKA 
PEKAKSPAKE EAKSPAEAKS 
EVKSPEKAKS PTKEEAKSPE 
KSPEKAKSPV KAEAKSPEKA 

PVKEEAKTPE KAKSPVKEEA 



RSPADKFPEK AKS PVKEEVK 
KEEEKPQEVK VKEPPKKAEE 



Searcher 
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851 EKAPATPKTE EKKDSKKEEA PKKEAPKPKV EEKKEPAVEK PKESKVEAKK 
901 EEAEDKKKVP TPEKEAPAKV EVKEDAKPKE KTEVAKKEPD DAKAKEPSKP 
951 AEKKEAAPEK KDTKEEKAKK PEEKPKTEAK AKEDDKTLSK EPSKPKAEKA 
1001 EKSSSTDQKD SKPPEKATED KAAKGK 
HITS AT: 700-731, 742-759 

REFERENCE 1: 135:176460 

L4 , ANSWER 16 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-48-6 REGISTRY 

CN L-Valinamide, L-alanyl-L-lysyl-O-phosphono-L-seryl-L-prolyl-L-valyl- 
L-lysyl-L-a-glutamyl-L-a-glutaniyl-L-alanyl-L-lysyl-O- 
phosphono-L-seryl-L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L- 
lysyl-O-phosphono-L-seryl-L-prolyl- (9CI) (CA INDEX NAME) 

SQL 19 

SEQ 1 AKSPVKEEAK SPEKAKSPV 



HITS AT: 1-18 

REFERENCE 1: 134:204745 

L4 ANSWER 17 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-47-5 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-O-phosphono-L-seryl-L-prolyl-L-valyl-L- 
lysyl-L-a-glutamyl-L-a-glutamyl-L-alanyl-L-lysyl-O- 
phosphono-L-seryl-L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L- 
lysyl-O-phosphono-L-seryl- (9CI) (CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 

HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 18 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-46-4 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-L-seryl-L-prolyl-L-valyl-L-lysyl-L- 
a-glutamyl-L-a-glutamyl-L-alanyl-L-lysyl-O-phosphono-L- 
seryl-L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L-lysyl-O- 
phosphono-L-seryl- (9CI) (CA INDEX NAME) 

SQL 18. 

SEQ 1 AKSPVKEEAK SPEKAKSP 



HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 19 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-45-3 REGISTRY 



Searcher : Shears 571-272-2528 



09/847586 



CN L-Proline, L-alanyl-L-lysyl-O-phosphono-L-seryl-L-prolyl-L-valyl-L 
lysyl-L-a-glutamyl-L-ct-glutamyl-L-alanyl-L-lysyl-L-seryl- 
L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L-lysyl-O-phosphono-L- 
seryl- (9CI) (CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 



HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 20 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-44-2 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-O-phosphono-L-seryl-L-prolyl-L-valyl-L 
lysyl-L-a-glutamyl-L-a-glutamyl-L-alanyl-L-lysyl-O- 
phosphono-L-seryl-L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L- 
lysyl-L-seryl- (9CI) (CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 



HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 21 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-43-1 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-L-seryl-L-prolyl-L-valyl-L-lysyl-L- 
a-glutamyl-L-a-glutarnyl-L-alanyl-L-lysyl-L-seryl-L- 
prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L-lysyl-O-phosphono-L- 
seryl- (9CI) (CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 



HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 22 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-42-0 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-L-seryl-L-prolyl-L-valyl-L-lysyl-L- 
a-glutamyl-L-a-glutamyl-L-alanyl-L-lysyl-O-phosphono-L- 
seryl-L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L-lysyl-L-seryl- 
(9CI) (CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 



Searcher 



Shears 571-272-2528 
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HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 23 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329020-41-9 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-O-phosphono-L-seryl-L-prolyl-L-valyl-L- 
lysyl-L-a-glutamyl-L-a-glutamyl-L-alanyl-L-lysyl-L-seryl- 
L-prolyl-L-a-glutamyl-L-lysyl-L-alanyl-L-lysyl-L-seryl- ( 9CI ) 
(CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 

HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 24 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 329019-83-2 REGISTRY 

CN L-Proline, L-alanyl-L-lysyl-L-seryl-L-prolyl-L-valyl-L-lysyl-L- 
a-glutamyl-L-a-glutamyl-L-alanyl-L-lysyl-L-seryl-L- 
prolyl-L-ot-glutamyl-L-lysyl-L-alanyl-L-lysyl-L-seryl- ( 9CI ) 
(CA INDEX NAME) 

SQL 18 

SEQ 1 AKSPVKEEAK SPEKAKSP 

HITS AT: 1-18 

** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 134:204745 

L4 ANSWER 25 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 222963-40-8 REGISTRY 

CN Protein (human brain gene KIAA0845 C-terminal fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 
CN GenBank BAA74868 

CN GenBank BAA74868 (Translated from: GenBank AB020652) 
CI MAN 
SQL 1034 

SEQ 1 NPISTAQAMM SFGGADALLG APFAPLHGGG SLHYALARKG GAGGTRSAAG 

51 SSSGFHSWTR TSVSSVSASP SRFRGAGAAS STDSLDTLSN GPEGCMVAVA 
101 TSRSEKEQLQ ALNDRFAGYI DKVRQLEAHN RSLEGEAAAL RQQQAGRSAM 
151 GELYEREVRE MRGAVL RLGA ARGQLRLEQE HLLEDIAHVR QRLDDEARQR 
201 EEAEAAARAL ARFAQ EAEAA RVDLQKKAQA LQEECGYLRR HHQEEVGELL 
251 GQIQGSGAAQ AQMQAETRDA LKCDVTSALR EIRAQLEGHA VQSTLQSEEW 
301 FRVRLDRLSE AAKVNTDAMR SAQEEITEYR RQLQARTTEL EALKSTKDSL 
351 ERQRSELEDR HQADIASYQE AIQQLDAELR NTKWEMAAQL REYQDLLNVK 



Searcher 



Shears 571-272-2528 
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401 MALDIEIAAY RKLLEGEECR 
451 KIKWEKSEK ETVIVEEQTE 
501 EAEGGEEETK SPPAEEAASP 
551 EVKSPEKAKS PAKEEAKSPP 
601 KSPAEAKSPE KAKSPVKEEA 
651 KEEAKSPEKA KSPEKAKSPE 
701 EAKSPEKAKS PVKEEAKSPE 



751 KSPVKEEAKS PEKAKSPEKA 

801 SPVKEEVKSP EKAKSPLKED 
851 EPPKKAEEEK APATPKTEEK 
901 ESKVEAKKEE AEDKKKVPTP 
951 KAKEPSKPAE KKEAAPEKKD 
1001 SKPKAEKAEK SSSTDQKDSK 
HITS AT: 708-739, 750-767 



IGFGPIPFSL PEGLPKIPSV STHIKVKSEE 
ETQVTEEVTE EEEKEAKEEE GKEEEGGEEE 
EKEAKSPVKE EAKSPAEAKS PEKEEAKSPA 
EAKSPEKEEA KSPAEVKSPE KAKSPAKEEA 
KSPAEAKSPV KEEAKSPAEV KSPEKAKSPT 
KEEAKSPEKA KSPVKAEAKS PEKAKSPVKA 
KAKSPVKEEA KSPEKAKSPV KEEAKTPEKA 



KTLDVKSPEA KT PAKE EARS PADKFPEKAK 

AKAPEKEIPK. KEEVKSPVKE EEKPQEVKVK 
KDSKKEEAPK KEAPKPKVEE KKEPAVEKPK 
EKEAPAKVEV KEDAKPKEKT EVAKKEPDDA 
TKEEKAKKPE EKPKTEAKAK EDDKTLSKEP 
PPEKATEDKA AKGK 



REFERENCE 1: 130:292252 



L4 ANSWER 26 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 147388-28-1 REGISTRY 

CN Phosphoprotein NF-H (rabbit isoform 1 C-terminal fragment) (9CI) 

(CA INDEX NAME) 
CI MAN 
SQL 606 



SEQ 1 EKETVIVEEQ TEEIQVTEEV TEEEEKEAKE EEGGEEEEAK SPTEGGAASP 

51 EEEAKSPAEA KSPVKEEAKS PAEAKSPAEA KSPAEAKSPE KAKSPVKEEA 



101 KSPEKAKSPV KEEAKSPAEA KSPEKAKSPA EAKSPEKAKS PVKEEAKSPE 



151 KAKSPAEAKS PEKAKSPAEA KSPEKAKSPV KEEAKSPEKA KSPVKEEAKS 



201 PAEAKSPEKA KSPVKEEAKS PEKAKSPAEA KSPVKEEAKS PEKAKSPEKE 
251 EAKSPAEAKS PEKAKSPEKA KSPVEVKSPA EAKSPEKAKS PVKEEAKSPE 



301 KAKSPVKEEA KSPEKAKSPV KEEAKSPEKA KSPVKEEAKS PEKAKSPVKE 



351 EAKSPEKAKS PVKEEAKSPE KAKSPEKAKS PVKEEAKSPE KAKSPVKEEA 
401 KSPEKAKSPV KEEAKSPEKE TPKKEEVKVK EPPKKVEETA PAPPKVEKDS 



451 KKDEAPKKEA PKPAVEKPKE STAEAKKDEA EDKKKAAPAK MEGKEEAKPK 
501 EKTEVAKKEP EDAKAKEPSK PTEKEPEKPK KEETPAAPVK KEAKEEARKP 
551 EEKPKTEAKA KEDDKALSKE PSKPKTEKAE KSSSTDQKDS RPPEKATEDK 
601 AAKGEK 

HITS AT: 92-109, 138-155, 176-193, 210-227, 230-247, 288-375, 
378-409 



** RELATED SEQUENCES AVAILABLE WITH SEQLINK** 



REFERENCE 1: 118:206731 



L4 ANSWER 27 OF 27 REGISTRY COPYRIGHT 2004 ACS on STN 
RN 119213-37-5 REGISTRY 
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Phosphoprotein NF-H (human clone HW10/HW12 subunit protein moiety 

reduced) (9CI) (CA INDEX NAME) 

MAN 

1054 

1 MMS FGGADAL LGAPFAPLHG GGSLHYALAR KGGAGGTRSA AGSSSGFHSW 
51 TRTSVSSVSA SPSRFRGAGA ASSTDSLDTL SNGPEGCMVA VATSRSEKEQ 
101 LQALNDRFAG YIDKVRQLEA HNRSLEGEAA ALRQQQAGRS AMGELYEREV 
151 REMRGAVLRL GAARGQLRLE QEHLLEDIAH VRQRLDDEAR QREEAEAAAR 
201 ALARFAQEAE AARVDLQKKA QALQEECGYL RRHHQEEVGE LLGQIQGSGA 
251 AQAQMQAETR DALKCDVTSA LRE IRAQ LEG HAVQSTLQSE EWFRVRLDRL 
301 SEAAKVNTDA MRSAQEEITE YRRQLQARTT ELEALKSTKD SLERQRSELE 
351 DRHQADIASY QEAIQQLDAE LRNTKWEMAA QLREYQDLLN VKMALDIEIA 
401 AYRKLLEGEE CRIGFGPIPF SLPEGLPKIP SVSTHIKVKS EEKIKWEKS 
451 EKETVIVEEQ TEETQVT,EEV TEEEEKEAKE EEGKEEEGGE EEEAEGGEEE 
501 TKSPPAEEAA SPEKEAKSPV KEEAKSPAEA KSPEKEEAKS PAEVKSPEKA 
551 KSPAKEEAKS P PEAKS P EKE EAKSPAEVKS PEKAKSPAKE EAKSP PEAKS 
601 PEKEEAKSPA EVKSPEKAKS PAKEEAKSPA EAKSPEKAKS PVKEEAKSPA 
651 EAKSPVKEEA KSPAEVKSPE KAKSPTKEEA KSPEKAKSPE KEEAKSPEKA 
701 KSPVKAEAKS PEKAKSPVKA EAKSPEKAKS PVKEEAKSPE KAKSPVKEEA 



751 KSPEKAKSPV KEEAKTPEKA KSPVKEEAKS PEKAKSPEKA KTLDVKSPEA 



801 KT PAKE EARS PADKFPEKAK SPVKEEVKSP EKAKSPLKAD AKAPEKEIPK 
851 KEEVKSPVKE EEKPQEVKVK EPPKKAEEEK APATPKTEEK KDSKKEEAPK 
901 KEAPKPKVEE KKEPAVEKPK ESKVEAKKEE AEDKKKVPTP EKEAPAKVEV 
951 KEDAKPKEKT EVAKKEPDDA KAKEPSKPAE KKEAAPEKKD TKEEKAKKPE 
1001 EKPKTEAKAK EDDKTLSKEP SKPKAEKAEK SSSTDQKDSK PPEKATEDKA 
1051 AKGK 
HITS AT: 728-759, 770-787 

REFERENCE 1: 110:89623 

=> fil horn 

FILE ' HOME 1 ENTERED AT 11:49:53 ON 26 MAY 2004 
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